On Robustness/Performance Tradeoffs in Linear Programming and Markov Decision Processes

نویسندگان

  • Huan Xu
  • Shie Mannor
چکیده

Computation of a satisfactory policy for a decision problem when the parameters of the model are uncertain is a problem encountered in many applications. The traditional robust approach is based on a worst-case analysis and may lead to overly conservative solutions. In this paper we directly quantify the robustness to uncertainty and consider the tradeoff between the nominal performance and robustness measures. Optimization in both linear programming and Markov decision processes is discussed. For linear programming we consider the tradeoff between the nominal cost of a solution and a robustness measure that quantifies the magnitude of constraint violation under the most adversarial parameters. We propose an algorithm that computes the whole set of Pareto efficient solutions based on parametric linear programming. For Markov decision processes, we consider the tradeoff between the performance under nominal parameters and the performance under adversarial parameters. For the special case where only the rewards are uncertain, we propose an algorithm that computes the whole set of Pareto efficient policies in a single pass.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Robustness in portfolio optimization based on minimax regret approach

Portfolio optimization is one of the most important issues for effective and economic investment. There is plenty of research in the literature addressing this issue. Most of these pieces of research attempt to make the Markowitz’s primary portfolio selection model more realistic or seek to solve the model for obtaining fairly optimum portfolios. An efficient frontier in the ...

متن کامل

The Robustness-Performance Tradeoff in Markov Decision Processes

Computation of a satisfactory control policy for a Markov decision process when the parameters of the model are not exactly known is a problem encountered in many practical applications. The traditional robust approach is based on a worstcase analysis and may lead to an overly conservative policy. In this paper we consider the tradeoff between nominal performance and the worst case performance ...

متن کامل

Performance Analysis of Dynamic and Static Facility Layouts in a Stochastic Environment

In this paper, to cope with the stochastic dynamic (or multi-period) problem, two new quadratic assignment-based mathematical models corresponding to the dynamic and static approaches are developed. The product demands are presumed to be dependent uncertain variables with normal distribution having known expectation, variance, and covariance that change from one period to the next one, randomly...

متن کامل

Approximate Linear Programming for Constrained Partially Observable Markov Decision Processes

In many situations, it is desirable to optimize a sequence of decisions by maximizing a primary objective while respecting some constraints with respect to secondary objectives. Such problems can be naturally modeled as constrained partially observable Markov decision processes (CPOMDPs) when the environment is partially observable. In this work, we describe a technique based on approximate lin...

متن کامل

Accelerated decomposition techniques for large discounted Markov decision processes

Many hierarchical techniques to solve large Markov decision processes (MDPs) are based on the partition of the state space into strongly connected components (SCCs) that can be classified into some levels. In each level, smaller problems named restricted MDPs are solved, and then these partial solutions are combined to obtain the global solution. In this paper, we first propose a novel algorith...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007